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Abstract — The purpose of this paper is to develop further 
the main concepts of Phenomena Dynamic Logic (P-DL) and 
Cognitive Dynamic Logic (C-DL), presented in the previous 
paper. The specific character of these logics is in matching 
vagueness or fuzziness of similarity measures to the uncertainty 
of models. These logics are based on the following fundamen- 
tal notions: generality relation, uncertainty relation, simplicity 
relation, similarity maximization problem with empirical content 
and enhancement (learning) operator. We develop these notions 
in terms of logic and probability and developed a Probabilis- 
tic Dynamic Logic of Phenomena and Cognition (P-DL-PC) 
that relates to the scope of probabilistic models of brain. In 
our research the effectiveness of suggested formalization is 
demonstrated by approximation of the expert model of breast 
cancer diagnostic decisions. The P-DL-PC logic was previously 
successfully applied to solving many practical tasks and also 
for modelling of some cognitive processes. 

I. Introduction 

In the paper [1] there was introduced a Phenomena 
Dynamic Logic (P-DL) and Cognitive Dynamic Logic (C- 
DL) as a generalization of the Dynamic Logic and Neural 
Modelling Fields theory (NMF) introduced in the previous 
papers 0, 0- Logics P-DL, C-DL provide the most general 
description of Dynamic Logic in the following fundamental 
notionsgenerality relation, uncertainty relation, simplicity 
relation, similarity maximization problem with empirical 
content and enhancement (learning) operator. This gener- 
alization provide interpretation of P-DL, C-DL logics in the 
frame of other approaches. 

In this paper we interpret logics P-DL, C-DL in terms of logic 
and probability: uncertainty we interpret as probability, while 
the process of learning as a semantic probabilistic inference 
G), 0, 0, 0. We also interpret mentioned fundamental 
notions. The resulting Probabilistic Dynamic Logic of Phe- 
nomena and Cognition (P-DL-PC) belong to the scope of 
the probabilistic models of brain [19], [20|. Thus, through 
logics P-DL, C-DL we extend the interpretation of Dynamic 
Logic and Neural Modelling Fields theory to probabilistic 
models of brain. The P-DL-PC logic as probabilistic model of 
brain was previously applied to modelling of some cognitive 
process 0, 0, 0, (2TJ. The effectiveness of P-DL-PC logic 
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demonstrated in this paper by approximation of the expert 
model of breast cancer diagnostic decisions. 

II. Universal productions. Data for prediction 

In our study learning models will be generated as sets of 
universal productions (u-productions), which are introduced 
in this section. Note that every set of universal formulas is 
logically equivalent to a certain set of u-productions. 

Consider a fixed first-order language £ in a countable sig- 
nature. Hereafter denote Aj the set of all atoms; L£ - the 
set of all literals; - the set of ground sentences. The set 
of ground atoms and the set of ground literals are denoted 
A°£ f± Ao l~l S° £ and L° & ^ L £ n S% correspondingly. 
Following examples of atoms and literals are given in the 
section VIII for the task of approximation of the expert 
model of breast cancer diagnostic decisions: 'number of 
calcifications per cm 3 less than 20', 'volume of calcifications 
in cm 3 not less or equal to 5', 'total number of calcifications 
more than 30 and etc. 

Let be the set of all substitutions and 0° C the set of 
ground substitutions, that are mappings variables to ground 
terms. All necessary notions from model theory and logic 
programming are elementary and can be easily found in 
books D2I, Ifl3ll. Ifl4t 

Definition. A record of the type 

R i=f V (Ai A • • • A A m <= Bi A • • • A B„) , 

where Aj., • • • A m , Bi, • ■ • , B„ are literals, and V stands for 
a bloc of quantifiers over all free variables of the formulae 
in brackets (universal closure), is called a u-production. A 
variant of u-production R is 

R0 ^ V (Ai0 A • ■ • A m 9 4= BiO A • • • A B n 9) , 

where 6 is an arbitrary one-to-one correspondence over the 
set of variables. Let Prod be the set of all u-productions. 

For example in section X presented the following u- 
production that was discovered by the learning model: 

IF TOTAL number of calcifications is more than 

30, AND VOLUME is more than 5 cm 3 , AND 

DENSITY of calcifications is moderate, 

THEN Malignant. 
Let Fact„ C A^ be a set of atoms from A that are 
valid for verification in algebraic system 03 appearing in 
practice. Our aim is to investigate as much "extra" facts 
about £ as possible, i.e., to predict or explain them. A natural 
assumption is that we can verify (falsify) each element of 

Fact {A6> | 9 e 0°, A e Fact„} . 



Certainly we may postulate our ability to check any literal 
of Fact* <=; Fact„ U {-iA | A G Fact t ,}. For the rest of the 
literals (and their conjunctions) the machinery of probabilis- 
tic prediction will be defined later on. Note that Fact* = 
Fact U Fact^, where Fact^ f± {-A | A G Fact Q } is the 
complete set of alternatives allowing a real test. 

The data are defined as a maximal (logically) consistent 
subset of the complete set of alternatives, i.e., being given 
a mapping £rg : Fact Q h-> {_L, T} (here _L - "false", T - 
"true") we conclude that 

Data [03] ^ {A | A G Fact Q and (<s (A) = T} U 

{-A | A 6 Fact and (<b (A) = _L} . 

III. Generality relation between theories 

The idea of a generality relation between theories can be 
viewed, for example, as a reduction of the set of properties 
predicted by the use of these theories. A more general theory 
(potentially) predicts a greater number of formal features. 
We start with a generality relation between one-element 
specifications, i.e., between u-productions. 

Definition. For two productions Ri = 

V (Ai A • • • A A TOl Bi A • • • A B ni ) and R 2 = 

V (Ci A • • • A C„ l2 <= Dx A • • • A D„ 2 ) a relation Ri y R 2 
("more general than") takes place if and only if there 
exists 8 G such that {R\9, • • , B ni 9} C {Di, ■ • • , D„ 2 }, 
{Axd, ••■ ,A mi 0} D {Ci, • • ■ , C m2 }, and m < n 2 , 
mi ^ m 2 , 1/ Ri = R 2 . 

The inclusion of the sets of premises designates that the more 
general u-production is, then the wider its field of application. 
The inverse inclusion (for conclusions) says that Ri predicts 
a greater number of properties using a smaller premise. 

Let S C Prod. Denote Fact [S; 93] the set of all AgLj 
such that for some R G S and 9 G 6°, 

RB = (Ai A • • • A A m <= Bi A • • • A B„), holds 
{Bi,-- - ,B„} C Data [23] and A G {A x ,-- - ,A m }. 

Thus, Fact [S; 03] is the set of ground literals predicted 
according to available data (about the model 03) together 
with u-productions in S. 

In the sequel let ^ be a reflexive closure of >-. One 
should pay attention to the fact: Ri > R 2 entails that 
Fact [{Ri} ; 03] contains Fact [{R 2 } ; 03]. 

Thereafter it isn't difficult to extend the domain of our 
generality relation to subsets of Prod. 

Definition. Let S, S' C Prod, and for any R' G S' we find 
R G S such that R )? R'. In this case we say 'S is not less 
general than S" (S > S'). 

It's straightforward to notice that Fact [5"; 03] C Fact [S; 03] 
for S and S' from the definition above. Remark that S 
may include u-productions apart from those, which are 
generalizations of elements of S'. 



IV. Probability/degree of belief 

The topic of distributing probability over formulas of prepo- 
sitional logic (as well as over ground statements in a first 
order language) being widely discussed in a literature and 
meets Kolmogorov's understanding of probability measure 
ifTTl . The following definition is given on the basis of analysis 
cited in iflOl . 

Definition. A probability over FCSj closed with respect 
to A, V and -i, is a function p : F \-> [0, 1] satisfying the 
following conditions: 

1) if h $ ("<& is a tautology"), then p ($) = 1; 

2) if I ' ($ A then p ($ V = p ($) + p (*). 

For any ground instance of a u-production its probability 
is defined as conditional, i.e., 

fi (Ai A • • • A m <= Bi A • • • A B„) = 
= n (Ai A • • • A m | Bi A • • • A B„) = 

y(AlA-A m ABiA-ABn) 

m(BiA---ab„) 

Let R G Prod. Denote as Sub [R] p those substitutions G 
0°, for which the premise of u-production R.8 has a non-zero 
probability. 

Prod'' ^{Re Prod | Sub [R] M ^ 0}; 
p(R) ^ inf {p (R6) | 9 G Sub [R] M }, where R G Prod' 1 . 

A value of conditional probability serves to characterize our 
degree of belief (and responsible for an uncertainty relation) 
in reliability of different causal connections included in 
temporary specification. Note that two productions are not 
necessary comparable with respect to generality relation 
moreover, their premisses may not be contained in the 
complete set of alternatives (and so these productions will 
be not valid for a direct check in a real structure 03). 

V. Simplicity of probabilistic theories 

Adding comparison of lower probabilistic estimations to 
the definition of generality relation we obtain the following 
definition. 

Definition. Let S,S' C Prod^. We say that S is more p- 
general than S' iff for every C' G S' there exists C G S 
such that C y C and /u(C) ^ A*(C), and in at least one of 
the cases the strong relation >- takes place. 

Hence, /i-generalization allows us to define a more general 
set S in such a way that the lower estimations of probabilities 
is not declined. When our belief to the elements of S is no 
less than that of S', then we have a simplicity relation - 
the set S is simpler than S' in order to describe/predict the 
properties. 

VI. Similarity measure with the empirical 
content 

By elaboration of u-productions we mean the gain of its 
conditional probability. 



Definition. A relation Ri C R2 ('probabilistic inference') 
for Ri,Ra G Prod' 1 means that Ri y R 2 and /Lt(Ri) < 

m(R 2 ). 

Definition. Let 7r be some requirements to be applied to 
elements of Prod' 1 , i.e. it : Prod/ 1 i-> {_L, T} (value is 
equal to _!_, if u-production satisfies 7r, and T - otherwise); 
fl ^ {Re Prod' 1 I vr (R) = T}. We say that R 2 G n is a 
minimal follower of Ri G Prod M relative to C in l~l (denoted 
as Ri R2), iff Ri C R2 and there is no intermediate 
u-production R 3 / 2 G IT such that Ri □ R 3 / 2 □ R 2 . 

In the prediction of a literal H the similarity measure for u- 
productions, which are valid for verification and applicable 
to the goal H, is equal to conditional probability fx (•). Thus 
we deal with a uniform measure of similarity. 

VII. Learning operator 
Definition. A production 

R = V (Ai A • • • A A m <— Bi A • • • A B„) 

is called a maximal specific u-production (urns-production) 
for prediction of a conjunction H = (Hi A ■ • • A Hfc), where 
{Hi, • • • , Hfc} C L£ and m ^ k, iff the following conditions 
are satisfied: 

1) there is a substitution 9 (not necessary ground) 
such that {Ai, • • • , A TO } C {Hi0, • • • , B k 9}, 
{Bi,...,B„}G{B0|BeFact*}; 

2) if D G {Ai, • • • , A m } and o G Sub [R] M , then 
fx (Ai0 o A ■ • • A A m 9 a ) < 

fx (Ai0 o A • • • A A rn 9 I Bi0 o A ■ • • A B n o ) 
and fx (D0 O ) < fx (D0 O | Bi0 o A • • • A B„0 O ); 

3) there is no R' G Prod' 1 , for which points (1-2) are 
hold along with RcR'; 

4) the u-production R can't be generalized up to some 
R' 6 Prod' 1 satisfying all the previous points (1-3) 
without decreasing its estimation fx (•). 

The conditions above (for corresponding ums-productions) 
are denoted as 'point. i', 1 ^ i ^ 4. 

Remark. Though condition point.4 emphasizes the nature 
of definition, but it isn't necessary for indication. Indeed, if 
R may be generalized up to R' under preserving point. 1- 
3, then fx (R) < fx (R') (otherwise we get R' C R - that 
contradicts point. 3 for R. 

Let 7r (R) = T be fulfilled for R G Prod' 1 iff conditions 
points. 1-2 are satisfied for R and H (the last one is fixed 
from this moment); denote 11 ^ ir^ 1 (T). 

Define the probabilistic fix-point operator : 2 ProdM M> 
2 ProdM as follows: for a set S C Prod'' it produces 

S' ^ {R' I R R' for some R G S} U 
U {R R G S (~1 n and there is no R' such that R R'}. 

Therefore the operator : S H> S' possess important 
properties: 

1) the set S' is always more precise than S (relative to 



2) the conditional probabilities fx (•) increase during the 
conversion to more particular cases (and so fuzziness 
decreases); 

3) the similarity measure with the empirical content be- 
comes greater for at least one u-production (in S) when 
the operator converts S to S' (if not S = S', of course); 

As a result the operator T n is the enhancement, or learning, 
operator in the sense of |fl~). 

Definition. A fix-point (f.p., for short) S of T T is optimal 
iff there is no other f.p. S' of considered operator, which is 
more /i-general than S. 

Statement. A subset S C Prod' 1 is a fix-point of the 
operator T n iff every element of S satisfies points. 1-3 for 
H. 

Corrolary. A subset S C Prod M is an optimal fix-point of 
the operator T n iff every element of S is a urns-production 
for prediction of H. 

Ums-productions may be viewed as a result of performing 
generalized scheme of the semantic probabilistic inference 
[4 1, [5 1, which is realized by the fix-point operator described 
above. The program system 'Discovery' (see lfl6l . ifTTl . [|9l , 
II2TI ) was developed: it carries out the propositional version 
of the probabilistic fix-point (learning) operator and was 
successfully applied to solving many practical tasks [21 J. 

VIII. Extraction of the expert model of breast 

CANCER DIAGNOSTIC DECISIONS 

We applied our method to approximation of the expert 
model of breast cancer diagnostic decisions that was obtained 
from the radiologist J.Ruiz IfTTl . At first we extract this model 
from the expert by the special procedure using monotone 
boolean functions IfTTl and then apply the program system 
'Discovery' lfT6l to approximate this model. 

A. Hierarchical Approach 

At first we ask an expert to describe particular cases using 
the binary features. Then we ask a radiologist to evaluate 
a particular cases, when features take on specific values. A 
typical query will have the following format: "If feature 1 
has value v\, feature 2 has value v%, feature n has value 
v n , then is a case suspicious of cancer or not?" 

Each set of values (vi, u 2 , v n ) represent a possible 
clinical case. It is practically impossible to ask a radiologist 
to generate diagnosis for thousands of possible cases. A hi- 
erarchical approach combined with the use of the property of 
mono tonicity makes the problem manageable. We construct 
a hierarchy of medically interpretable features from a very 
generalized level to a less generalized level. This hierarchy 
follows from the definition of the 11 medically oriented 
binary attributes. The medical expert indicate that the original 
11 binary attributes w\, w 2 , IU3, 2/1, V2, 2/3, 2/4, Us, %4, %5 
could be organized in terms of a hierarchy with development 
of two new generalized attributes x\, depending on attributes 
wi,W2, w 3 , and x 2 , depending on attributes 2/1, 2/2, 2/3, 2/4, 2/5- 



A new generalized feature, x% - 'Amount and volume of 
calcifications' with grades (0 - 'benign' and 1 - 'cancer') 
was introduced based on features: w\ - number of calcifi- 
cations/cm3, w 2 - volume of calcification, cm3 and w 3 - 
total number of calcifications. We view X\ as a function 
g(wi, w 2 , Ws) to be identified. Similarly a new feature x 2 - 
'Shape and density of calcification' with grades: (1) for 'can- 
cer' and (0)- 'benign' generalizes features: y\ - 'irregularity 
in shape of individual calcifications' y 2 - 'variation in shape 
of calcifications' ys - 'variation in size of calcifications' y\ - 
'variation in density of calcifications' 7/5 - 'density of calcifi- 
cations'. We view x 2 as a function x 2 = /i(j/i, 2/2, 2/3,2/4, 2/5) 
to be identified for cancer diagnosis. 

As result we have a decomposition of our task as follows: 

/(a;i,X2,a;3,X4,X5) = 
f(g (wi, W2, w 3 ) ,h (2/1,2/2,2/3, 2/4, 2/5) ,x 3) x 4 ,x 5 ) ■ 

B. Monotonicity 

Giving the above definitions we can represent clinical 
cases in terms of binary vectors with five generalized features 
as: (a;i, X2, £3, Xi, X5). Let us consider two clinical cases 
that are represented by the two binary sequences: (10110) 
and (10100). If radiologist correctly diagnose (10100) as 
cancer, then, by utilizing the property of monotonicity, we 
can also conclude that the clinical case (10110) should 
also be cancer. Medical expert agreed with presupposition 
about monotonicity of the functions / (x\ , x 2 , X3, £4, X5) and 

h (2/1, 2/2, 2/3, 2/4, 2/5). 

Let us describe the interview with an expert using mini- 
mal sequence of questions to completely infer a diagnostic 
function using monotonicity. This sequence is based on fun- 
damental Hansel lemma lfl5l . We omit a detailed description 
of the specific mathematical steps. They can be found in fl8l . 
Table 1 illustrates this. 

C. Expert model extraction 

Columns 2 and 3 present values of above defined 
functions / and h. We omit a restoration of function 
g (wi, w 2 , W3) because few questions are needed to restore 
this function. All 32 possible cases with five binary features 
(xi, x 2 , xs,X4, £5) are presented in column 1 in table 1. They 
are grouped and the groups are called Hansel chains ifTTIl . 
The sequence of chains begins with the shortest chain 1 - 
(01100) < (11100) for five binary features. Then largest 
chain 10 consists of 6 ordered cases: (00000) < (00001) < 
(00011) < (00111) < (01111) < (11111). The chains 
are numbered there from 1 to 10 and each case has its 
number in the chain, e.g., 1.2 means the second case in 
the first chain. Asterisks in columns 2 and 3 mark answers 
obtained from an expert, e.g., 1* for case (01100) in column 
3 means that the expert answered 'yes'. The answers for 
some other chains in column 3 are automatically obtained 
using monotonicity. The value f(01100) = 1 for case 1.1 is 
extended for cases 1.2, 6.3. and 7.3 in this way. Similarly 
values of the monotone Boolean functions h are computed 
using the table 1. The attributes in the sequence (10010) 



are interpreted as 2/1,2/2,2/3,2/4,2/5 f° r me function h instead 
of xi,x 2 ,X3,X4,xs. The Hansel chains are the same if the 
number of attributes is the same five in this case. 

Column 5 and 6 list cases for extending functions' values 
without asking an expert. Column 5 is for extending func- 
tions' values from 1 to 1 and column 6 is for extending them 
from to 0. If an expert gave an answer opposite (f(01 100) = 
0) to that presented in table 1 for function / in the case 1.1, 
then this value could be extended in column 2 for cases 7.1 
(00100) and 8.1 (01000). These cases are listed in column 
5 for case (01100). There is no need to ask an expert about 
cases 7.1 (00100) and 8.1 (01000). Monotonicity provides 
the answer. The negative answer f(01100) = can not be 
extended for f(l 1 100). An expert should be queried regarding 
f(l 1 100). If his/her answer is negative f(l 1 100) = then this 
value can be extended for cases 5.1. and 3.1 listed in column 
5 for case 1.2. Relying on monotonicity, the value of f for 
them will also be 0. 

The total number of cases with asterisk (*) in columns 
2 and 3 are equal to 13 and 12. These numbers show 
that 13 questions are needed to restore the function 
/ (xi, x 2 , X3, X4,, £5) and 12 questions are needed to restore 
the function h (2/1, 2/2, 2/3, 2/4, 2/5) ■ This is only 37.5% of 32 
possible questions. The full number of questions for the 
expert without monotonicity and hierarchy is 2 11 = 2048. 

IX. Approximation of the expert model by 

LEARNING OPERATOR 

For the Approximation of the expert model we used the 
program system 'Discovery' II 1 611 . that realizes the proposi- 
tional case of the probabilistic fix-point learning operator. 
We discovered several dozens diagnostic rules that were 
statistically significant on the 0.01, 0.05 and 0.1 levels 
of (F-criterion). Rules were extracted using 156 cases (73 
malignant, 77 benign, 2 highly suspicious and 4 with mixed 
diagnosis). In the Round-Robin test our rules diagnosed 134 
cases and refused to diagnose 22 cases. The total accuracy 
of diagnosis is 86%. Incorrect diagnoses were obtained in 
19 cases (14% of diagnosed cases). The false-negative rate 
was 5.2% (7 malignant cases were diagnosed as benign) 
and the false-positive rate was 8.9% (12 benign cases were 
diagnosed as malignant). Some of the rules are shown in table 
2. This table presents examples of discovered rules with their 
statistical significance. In this table: 

• 'NUM' - number of calcifications per cm 3 ; 

• 'VOL - volume in cm 3 ; 

• 'TOT' - total number of calcifications; 

• 'DEN' - density of calcifications; 

• 'VAR' - variation in shape of calcifications; 

• 'SIZE' - variation in size of calcifications; 

• TRR' - irregularity in shape of calcifications; 

• 'SHAPE' - shape of calcifications. 

We studied three levels of similarity measure: 0.7, 0.85 
and 0.95. A higher level of conditional probability decreases 
the number of rules and diagnosed patients, but increases 
accuracy of diagnosis. 



Table 1 . Dynamic sequence of questions to expert 



1 


2 


3 


4 


5 


6 


7 


Number 


/ 

Diagnose 


h 

Form and V 


Monotonic extrapolation 


Chain 


Case 


1 1 — y 1 


0^0 


(01100) 


1* 


1* 


1.2, 6.3, 7.3 


7.1, 8.1 


Chain 1 


1.1 


(11100) 


1 


1 


6.4, 7.4 


5.1, 3.1 




1.2 


(01010) 


0* 


1* 


2.2, 6.3, 8.3 


6.1, 8.1 


Chain 2 


2.1 


(11010) 


1* 


1 


6.4, 8.4 


3.1, 6.1 




2.2 


(11000) 


1* 


1* 


3.2 


8.1, 9.1 


Chain 3 


3.1 


(11001) 


1 


1 


7.4, 8.4 


8.2, 9.2 




3.2 


(10010) 


0* 


1* 


4.2, 9.3 


6.1, 9.1 


Chain 4 


4.1 


(10110) 


1* 


1 


6.4, 9.4 


6.2, 5.1 




4.2 


(10100) 


1* 


1* 


5.2 


7.1, 9.1 


Chain 5 


5.1 


(10101) 


1 


1 


7.4, 9.4 


7.2, 9.2 




5.2 


(00010) 





0* 


6.2, 10.3 


10.1 


Chain 6 


6.1 


(00110) 


1* 


0* 


6.3, 10.4 


7.1 




6.2 


(oino) 


1 


1 


6.4, 10.5 






6.3 


(11110) 


1 


1 


10.6 






6.4 


(00100) 


1* 


0* 


7.2, 10.4 


10.1 


Chain 7 


7.1 


(00101) 


1 


0* 


7.3, 10.4 


10.2 




7.2 


(01101) 


1 


1* 


7.4, 10.5 


8.2, 10.2 




7.3 


(11101) 


1 


1 


5.6 






7.4 


(01000) 





1* 


8.2 


10.1 


Chain 8 


8.1 


(01001) 


1* 


1 


8.3 


10.2 




8.2 


(01011) 


1 


1 


8.4 


10.3 




8.3 


(11011) 


1 


1 


10.6 


9.3 




8.4 


(10000) 





1* 


9.2 


10.1 


Chain 9 


9.1 


(10001) 


1* 


1 


9.3 


10.2 




9.2 


(10011) 


1 


1 


9.4 


10.3 




9.3 


i i m '\ '\ \ 
(1U111 ) 


1 

1 


1 

1 


10.6 


10.4 




y.4 


(00000) 








10.2 




Chain 10 


10.1 


(00001) 


0* 





10.3 






10.2 


(00011) 


1* 





10.4 






10.3 


(00111) 


1 


1* 


10.5 






10.4 


(01111) 


1 


1 


10.6 






10.5 


(11111) 


1 


1 








10.6 


Questions 


13 


12 











Table 2. Examples of discovered diagnostic rules 



Diagnosis 


/-criteria 


Value, /-criteria 


Precision 


rule 






0.01 


0.05 


0.1 


on control 


If 10 < NUM < 20 


NUM 


0.0029 


+ 


+ 


+ 


93.3% 


and VOL > 5 


VOL 


0.0040 


+ 


+ 


+ 




then malignant 














If TOT > 30 


TOT 


0.0229 




+ 


+ 


100.0% 


and VOL > 5 


VOL 


0.0124 




+ 


+ 




and DEN is moderate 


DEN 


0.0325 




+ 


+ 




then malignant 














If VAR is marked 


VAR 


0.0044 


+ 


+ 


+ 


100.0% 


and 10 < NUM < 20 


NUM 


0.0039 


+ 


+ 


+ 




and IRR is moderate 


IRR 


0.0254 




+ 


+ 




then malignant 














If SIZE is moderate 


SIZE 


0.0150 




+ 


+ 


92.86% 


and SHAPE is mild 


SHAPE 


0.0114 




+ 


+ 




and IRR is mild 


IRR 


0.0878 






+ 




then benign 















Results for them are marked as Discovery 1, Discovery2 
and Discovery3. We extracted 44 statistically significant 
diagnostic rules for 0.05 level of F -criterion with a con- 
ditional probability no less than 0.75 (Discoveryl). There 
were 30 rules with a conditional probability no less than 
0.85 (Disco very2) and 18 rules with a conditional probability 
no less than 0.95 (Discovery3). The most reliable 30 rules 
delivered a total accuracy of 90%, and the 18 most reliable 
rules performed with 96.6% accuracy with only 3 false 
positive cases (3.4%). 

X. Decision rule (model) extracted from the 

EXPERT THROUGH MONOTONE BOOLEAN FUNCTIONS 

We obtained Boolean expressions for function 
^(2/1,2/2,2/3,2/4,2/5) ('shape and density of calcification') 
from the information depicted in table 1 with the following 
steps: 

- Find all the maximal lower units for all chains as 
elementary conjunctions; 

- Take the disjunction of obtained conjunctions; 

- Exclude the redundant terms (conjunctions) from the 
end formula. 

Using 1 and 3 columns we have 

X2 = M2/1j2/2,2/3,2/4,2/s) = 2/22/3Vy22/4Vyiy 2 Vyiy4V2/iy 3 V 
V 2/22/32/5 V y 2 V yi V ?/32/42/5 = 2/2 V z/i V 2/32/42/5- 

Function g (wi, 11)3,003) — w 2 V W1W3 we may obtain by 
direct 2 3 = 8 questions for the expert. 
Using 1 and 2 columns we have 

/ (a?) = X2X3 V X1X2X4 V x\x 2 V X1X3X4 V X1X3 V X3X4 V X3 

VX2X5 V X1X5 V X4X5 = X\X 2 VI3 V (x 2 XiX4) X5 = 

(w 2 V W1W3) (j/i V y 2 V 2/32/42/5) V13V 
V (2/1 V 7/2 V 2/32/42/5) (w2 V W1W3) X4X5. 

XI. Comparison of the expert model with its 

APPROXIMATION BY LEARNING OPERATOR 

For compare rules discovered by the learning operator 
(Discovery system) with the expert model we asked the 
expert to evaluate this rules. Below we present some rules, 
discovered by Discovery system, and radiologists comments 
regarding these rules as approximation of his model. 
IF TOTAL number of calcifications is more than 
30, AND VOLUME is more than 5 cm 3 , AND 
DENSITY of calcifications is moderate, 
THEN Malignant, 
/-criterion significant for 0.05. Accuracy of diagnosis for 
test cases - 100%. Radiologist's comment - this rule might 
have promise, but I would consider it risky. 

IF VARIATION in shape of calcifications is 
marked, AND NUMBER of calcifications is be- 
tween 10 and 20, AND IRREGULARITY in shape 
of calcifications is moderate, 
THEN Malignant. 



/-criterion significant for 0.05. Accuracy of diagnosis for 
test cases - 100%. Radiologist's comment - I would trust 
this rule. 

IF variation in SIZE of calcifications is moderate, 
AND variation in SHAPE of calcifications is mild, 
AND IRREGULARITY in shape of calcifications 
is mild, THEN Benign, 
/-criterion significant for 0.05. Accuracy of diagnosis for 
test cases - 92.86%. Radiologist's comment - I would trust 
this rule. 
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